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Abstract. We approach the problem of finding the sharp sufficient 
condition of the boundedness of all two weight Calderon-Zygmund op- 
erators. We solve this problem in L 2 by writing a formula for a Bellman 
function of the problem. 



1. Introduction 

1.1. Preliminaries. In this paper we give a simple Bellman function solu- 
tion of the so-called "bump onjecture" for the two weight estimates of the 
singular integral operators. 

The original (still open) question about two weight estimates for the sin- 
gular integral operators is to find a necessary and sufficient condition on the 
weights w and v such that a Calderon-Zygmund operator T : L p (u) — > L p (v) 
is bounded, i.e. the inequality 

(1.1) j \Tf\ p vdx <C j \f\ p udx V/ € LP(u) 

holds. 

In the one weight case v = u the famous Muckenhoupt condition is nec- 
essary and sufficient for (jl.ip 



\ p/p' 

< oo 



(A 2 ) sup^|Q|- i y vdx^j(\Q\- 1 j v- p '/ p dx^j 

where the supremum is taken over all cubes Q. More precisely, this condition 
is sufficient for all Calderon-Zygmund operators, and is also necessary for 
classical (interesting) Calderon-Zygmund operators, such as Hilbert trans- 
form, Riesz transform (vector-valued, when all Riesz transforms are consid- 
ered together), Beurling-Ahlfors operator. 
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The inequality (jl.ip is equivalent to the boundedness of the operator 
M v i/pTM u ~i/ p in the non-weighted LP; here is the multiplication oper- 
ator, Mpf = ipf. Denoting w = u~ p l p we can rewrite the problem in the 
symmetric form as the LP boundedness of M v i/ p TM wl / p i . 

So the problem can be stated as: Describe all weights (i.e. non-negative 
functions) v, w such that the operator M v \/ p TM wl / p i is bounded in (the 
non-weighted) LP 

Note, that this symmetric formulation is is more general than (jl.ip . be- 
cause in (jl.ip is is usually assumed that u and v are locally integrable, but 
also for (II. ip to hold for interesting operators (Hilbert Transform, vector 
Riesz Transform, Beurling-Ahlfors Transform, etc.) the function \ju also 
has to be locally integrable. 

For the interesting operators the following two weight analogue of the A p 
condition is necessary for the boundedness of the operator M v i/ p TM wl/p i: 

(1.2) sup ( IQr 1 f vdx] (\Q\~ l [ wdx^) < oo 

q V Jq ) V Jq J 

or in the symmetric form 



\ 1/p ( „ , f \ 1/p ' 
s ' wdx ) < co 



(i.3) supfiQr 1 f vdx) P (iqi- 1 [ 

Q \ jq J V Jq 

Simple counterexamples show that this condition is not sufficient for the 
boundedness. So a natural way to get a sufficient condition is to replace the 
L 1 norms of v and w in (ll.3p (or the LP and LP' norms of v 1 ^ and u> 1//p ') 
by some stronger Orlicz norms ("bumping" the LP norms). 

Namely, given a Young function <I> and a cube Q one can consider the 
normalized on Q Orlicz space L (Q) with the norm given by 

J A>0: / t f«*< 1 



1 <-*<«> I Jq \ A I |«| 

And it was conjectured (for p = 2) that if the Young functions <3?i and $2 
satisfy the condition 

m A^ r dx 

( L4 ) / ^ 7 \ < °°> 



$1,2(2) 



then the condition 



i 1 - 5 ) SU Pll W llr*i^JI W llr*o^ < 00 



<3 



L*i(Q)" n L*2(Q) 



implies that for any bounded Calderon-Zygmund operator T the operator 
M v i/ 2 TM w i/ 2 is bounded in L? . Usually in the literature a more complicated 
(although equivalent) form of this conjecture was presented, but at least in 
the case p = 2 condition (jl.5p seems more transparent Q 



lr The bump condition was also stated for p / 2, but in this paper we only deal with 
the case p — 2. 
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Condition (|1.5[) was considered in numerous papers in the attempt to 
prove its universal sufficiency for all Calderon-Zygmund operators. The 
reader can find beautiful approaches in [I], [3], [4], [5], [6], [12], [18], [19] . 
where partial results for some Calderon-Zygmund operators were proved 
(note that [19] is about maximal operator and not about Calderon-Zygmund 
operators). Finally in [13] the sufficiency of bump condition for all Calderon- 
Zygmund operators to be bounded was fully proved (and even generalized 
to all p € (1, oo)), although in formally less general situation of the weights. 

Simultaneously and independently the first version of the present paper 
[16j appeared. Slightly earlier the sketch of the approach (with the main 
ideas but without much details) was circulated as [15]. A very diligent reader 
will recognize that the approaches in the present paper (and thus in earlier 
versions [15], [16]) and in Lerner's paper [13] are very different, but still have 
something in common. This very important common point is the "coupling 
by the same cube" feature. It is the main winning idea of [13]. And it is the 
feature of the present paper (and [15], [16]) as Bellman function approach 
automatically should have this feature. 

1.2. What is done in the paper. Formally, in this paper we prove the 1? 
case of the bump conjecture using Bellman function method. As it is now 
well-known, a general Calderon-Zygmund operator can be represented as 
an average of dyadic shift and paraproducts, so it is sufficient to prove the 
estimates for such operators, and that is exactly what is done in the paper. 

However we think that the results obtained obtained in the paper that 
were used to prove the bump conjecture are of significant interest by them- 
selves; probably they are even more interesting than the solution of the 
bump conjecture. 

Let us shortly describe what is done in the paper. 

First, the Orlicz norm is not easy to work with. We introduced a lower 
bound for the Orlicz norm, which gives a more tractable, in our opinion, way 
to "bump" the averages. In particular, it allowed us to apply the Bellman 
function method. 

The application of the Bellman function method is pretty standard, with 
one little twist: one of the argument belongs to an infinite-dimensional space. 

The estimates for the Haar shifts and for the paraproducts are reduced 
to two embedding theorems, so the operators are constructively factorized 
through £ 2 . This essentially means that the bump condition is a rather 
rough one, since in more delicate two weight situations no such factorization 
appeared to be possible (at lest no factorization was found). 

Namely, it is known, see [17], that in the case of power bumps (&(t) = 
t 1+£ ) one can insert a Muckenhoupt A p weight between w~ 1 and Cv, so 
the boundedness follows immediately. For finer bumps such insertion of A p 
weights is impossible, but the constructive factorization through £ 2 can be 
considered the best next thing. 
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Finally, the main estimates are proved in very general (non-homogeneous) 
martingale settings, and can be used in more general situations. In partic- 
ular, our results give the proof of the bump onjecture for the Calderon- 
Zygmund operators on geometrically doubling metric spaces (equipped with 
a doubling measure). Indeed, since random "dyadic" lattices can be con- 
structed on geometrically doubling metric spaces, and representation of 
Calderon-Zygmund operators on such spaces as an average of Haar shifts is 
now known, everything follows from our results (see Theorem 12.31 below) . 

1.3. Orlicz norms and distribution functions. Orlicz norm is not very 
convenient to work with, so we would like to replace it be something more 
tractable. 

1.3.1. A lower bound for the Orlicz norm. Let $ be a continuous non- 
negative increasing convex function such that <3?(0) = and f + °° < +oo. 

Define 'f(s) parametrically by VP(s) = 3>'(i) when s = ^T^rm (t > 0). Then 
^f(s) is positive and decreasing for s > and s^f(s) is increasing. Moreover 
Jo s4fs) ^ +°°- Indeed, using our parameterization we can rewrite the last 
integral as 

/ i $»(() 



*(t) + *'(t) 2 J dt ' 

The first integral converges by our assumption and the second integrand has 
a bounded near +oo antiderivative g^y- 

Let w > on Q C W 1 . Define the normalized distribution function N of 
w by 

(1.6) N(t) = N$(t) = -L\{x€l: w(x) > t}\ 

Lemma 1.1. Let \£ : (0, 1] — > M+ be a decreasing function such that the 
function s i— >■ s^(s) is increasing. Let & be a Young function and let 

1 



*(s) < C&(t) where 



$(i)$'(i) 

for all sufficiently large t. Then for N = Nf 



(1.7) n * (iY):= y N(t)*(N(t))dt<C\\w\\ LHi) . 

Proof. The left hand side scales like a norm under multiplication by con- 
stants, so it is enough to show that if < 1, i.e., 



-L f = [ N(t)&(t)dt < 1 
\I I J i Jo 



then n^(iV) is bounded by a constant. Since s^(s) increases, we may have 
trouble only at +oo It is cleat that it suffices to estimate the integral over 
the set where *$>(N(t)) > &(t) but since ^ is decreasing this means that 
N(t) < C/($(i)$'(i)), so we get at most J +oc ^(t^dt and we are done. □ 
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Remark. In the above Lemma 11.11 we do not need the assumption that 

(1.8) / —, A s < oo. 

Jo s^{s) 

But in what follows this assumption will be needed, and the reasoning in 
the beginning of this section shows that for any Young function $ satisfying 
f°° (&(t))~ 1 dt < oo we can find ^/ from Lemma 1 1 . 1 1 satisfying (ll.8p . 

1.3.2. Examples. In the above section only the behavior of 3> at +oo (equiv- 
alently, the behavior of ^ near 0) was important, so we will concentrate our 
attention there. 

Let $(i) = t(lni) a , a > 1 near oo. Then 

&(t) ~ (lnt) a , $(t)$'(t) ~ t(lnt) 2a , 

so ^f(s) := (ln(l/s)) Q satisfies the assumptions of Lemma ll.lt to see that 
we notice 

ln($(t)$'(t)) ~ Int. 
If $(t) = tlnt(lnlnt) a , a > 1, then 

$'(t) ~ lnt(lnlnt) a , (t) ~ t(lnt) 2 (lnlnt) 2Q 

and ^(s) = ln(l/s)(lnln(l/s)) a works, because again \n($(t)$' (t)) ~ Int. 
Note that in both examples f Q (s^(s))~ 1 ds < oo. 

The examples of Young functions with higher order logarithms are treated 
similarly. 

1.4. Main result. Let ^1,^2 : (0,1] — > M + be as above, i.e. ^1,2 are 
decreasing, s 1— > sVl/i^s) are increasing and 

1 ds 
— — ^- < 00. 

Recall that for a weight w the normalized distribution function Nq is defined 
by 



Theorem 1.2. Let t/ie weights v, w satisfy 

(1.9) su pnWi (iV§)n W2 (iV^)<oo; 

Q 

/iere t/ie supremum is taken over all cubes Q, and is defined by (|1.7|) . 
T/ien /or any bounded Calderdn-Zygmund operator T the operator 

M vl/2 TM wl/2 

is bounded in I? . 



6 



F. NAZAROV, A. REZNIKOV, S. TREIL, AND A. VOLBERG 



2. Reductions: Haar shifts, paraproducts and embedding 

theorems 

First, let us reduce the problem to its dyadic (martingale) analogue, i.e. to 
the estimates of the so-called Haar shifts and papraproducts. 

Since a bounded Calderon-Zygmund operator can be represented as a 
weighted average (over the random dyadic grids) of Haar shifts and para- 
product and their adjoins, where the weights decay exponential in complex- 
ity of the Haar shifts, it is sufficient to get the estimates for the Haar shifts 
that grow sub-exponentially (for example, polynomially) in the complexity 
of the shifts and the estimates for the paraproducts (there is no complexity 
of the paraproducts) . 

The estimates for each operators will be in turn factored into two embed- 
ding theorem, and these embedding theorem are proved in this paper. 

The embedding theorems and so the estimates of the Haar shifts and 
paraproducts hold in very general martingale settings, 

2.1. General setup. Consider a measure space X with cr-finite measure fi 
let Ck = {Qj}j, k <G Z (or k € Z + ) be partitions of X into disjoint sets Qj, 
< (i(Q^ < oo. 

We assume that the partition £k+i is a refinement of 

Let 21 be the cr-algebra generated by all the partitions In what follows 
all functions on X we consider will be assumed to be 21-measurable. 

With respect to this cr-algebras we can define martingale averaging oper- 
ators Efc, and martingale difference operators AjJ := — + Efc+„. 

We adapt the following notation. 

chl The collection of children of / € C, i.e. if / £ C n then chl = 

{J € C n+1 : J C /}. 
chj. / The collection of children of the order k of I € C; cho(I) = {/}, 

ch fc+1 (J) = {ch(J) : J G ch fc (J)}. 
(/)/' fr / Tne avera g e of / ove r I, (f)j = f £ f{x)dfi(x); 

Ej The averaging operator, Ejf := note that 

E k = J2 IeC E r 

A j Martingale difference operator, A 7 := — E l + Yljech(i) ^ji no * e 

that A k = *£ IeCk A r 
A™ Martingale difference operator of order n, 

^-=- E i + E E r 

Jech n (/) 

Since the measure \x is assumed to be fixed we sometimes will be using 
\E\ for /j,(E) and dx for dfi(x). We also will be using I? for L 2 (fi) 

The prototypical example is X = K or R d with C being a dyadic lattice 

V. 



2.2. Haar shifts. 
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Definition 2.1. A Haar shift § of complexity n is is given by 

QeV 

where the operators § n act on A"L 2 and can be represented as integral 
operators with kernels a,Q, ||ttglloo — The latter means that for all 

C S > Q f>9)= a Q (x,y)f(y)g(x)dxdy. 

J Q 

This is a slightly more general definition than the one in [TO] , but only the 
estimate 

Q'/nll 00 — \Q\ ^ ^ essential for our construction. Note also that 
according to the definition in jlOj the complexity of the corresponding shift 
is n — 1, not n, which really does not matter; we just find our definition of 
complexity a bit more convenient. 

The estimate ||cIq||oo — \Q\ _1 means that the operators Sg are "L 1 x L 1 
normalized", meaning that 

(2-1) \ { 8 Q f ig) <\Q\Mk^. V/,,GAJ^ 

Haar shifts of complexity 1 are simply "L 1 x L 1 normalized" martingale 
transforms; martingale transform here means in particular that the sub- 
spaces Aj are orthogonal, and S can be represented as an orthogonal sum 
of the operators . 

A Haar shift of complexity n > 2 is not generally a martingale transform, 
meaning that the subspaces generally intersect, so S does not split into 

direct sum of . 

However, if one goes with step n, then the corresponding operator is a 
martingale transform, so a Haar shif of complexity n can be represented as 
a sum of n Haar shifts of complexity 1. Namely, for k = 1,2,... , n — 1 define 

C k = {C k+j : j e Z}, 

and let 

^= E V 

QeC k 

Then S = X^fc=o ^ k an< ^ eac h is a Haar shift of complexity 1 with respect 
to the lattice C k . 

Therefore, uniform estimate for the Haar shifts of complexity 1 (i.e. for the 
"L 1 x L 1 normalized" martingale transforms) gives the linear in complexity 
estimate for the general Haar shifts. 
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2.3. Paraproducts. Given the lattice C and a locally integrable function 
b, the paraproduct II = Ilf, = Hb(£) is defined as 

n/ := J> Q /)(A Q &). 

QeC 

The necessary and sufficient condition for the paraproduct to be bounded is 
that 

supl^r 1 £ ||A Q 6|||<oo. 
ReC QeC-.QcR 

In the case of dyadic lattice in M. d or, more generally in the homogeneous 
situation, when 

• * ■ t \Q\ n 

mf mf > 

-R6-C QGch(i?) | it | 

this condition is equivalent to b belonging to the corresponding martingale 
BMO space BMO £ 

2.4. Reduction to the martingale case. To reduce the problem to the 
martingale case we use the following result can be found in [U] and [lUj : 

Theorem 2.2. LetT be a C alder on- Zygmund operator inM. d . There exists 
a probability space (^,P) of dyadic lattices T> u , such that 

T = C[ [ f]2- £n S n (u)dF(u;)+ [ (U\u;) + (U 2 (u))*)dF(u;)) , 
\Jn n=1 Jn J 

where S n (uj) are Haar shifts of complexity n with respect to the lattice T>^, 
n 1,2 (u;) are the paraproducts with respect to the lattice T> w , ||n 1,2 (w)|| < 1. 

The constants C and e depend on d, \\T\\ and C alder on- Zygmund param- 
eters of the kernel of T . 



Theorem 12.21 implies immediately that the main theorem (Theorem 11.2)1 
follows from the theorem below. 

Theorem 2.3. Let the weights v, w satisfy the assumptions of Theorem 
HOI Then 

(i) For all Haar shifts S of order 1 the operators M v i/2SM w i/2 are uni- 
formly bounded in L 2 , \ \ M v 1/2 S-My, 1/2 1| < C, where C depends on 
^2, ^2, the supremum in (|1.9[) . but not on the lattice C 

(ii) For all Haar shifts §> n the operators M v i/2SM w i/z uniformly bounded 
in L 2 by Cn, where C is the constant from (i). 

(iii) Let II = TEf, be a paraproduct such that 

(2.2) iJp 1 £ ||A 7 6||L|/| < 1 VJ€£. 

/G£:/CJ 

Then the operator M^i/alLM i/a is bounded in L 2 by C, where again 
C depends on ^2, ^2, the supremum in (jl.9p . but not on the lattice 
C. 
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Remark 2.4. For the homogeneous lattices, i.e. for lattices satisfying 

inf inf f\ =: 6 > 

JeC /ech(j) \J\ 

all the normalized LP norms |/| — A^-^Hp, p € [l,oo] are equivalent in the 
sense of two sided estimates. So for such lattices condition (|2.2p means that 
||n|| < C(5). So Theorem 12.31 gives the estimates that being fed to Theorem 
E2] imply Theorem [L2j 

As it was discussed above, (i) implies (ii). Statement (i) is obtained from 
the following embedding theorem: 

Theorem 2.5. Let ^ be as above. Then for any weight w on X such that 
n *W) <°° f° r allLeC 

(2.3) ^(ATn 1 )^/)- 1 f ^{fw^dX „(L) < CH/lli, 
lec ' 

for all f 6 L 2 (w); here C = C($?) and in the summation we skip L on which 
w = 0. 

Let us see that this theorem implies the condition (i) of Theorem 12.31 As- 
sume, multiplying the weights by appropriate constants that the inequality 

(2-4) n^(Nf)n^(Nf) < 1 

holds for all I € C. Then 

KS/u; 1 ^,^ 72 )! <El^/ A /(^ 1/2 )' A /(^ 1/2 ))l 
lec 

^l/r^A^/^iMiA^ 172 )!!! 

iec 



< 



Ei'i 



1 1 

2^ M n T , (Nf) 2^' 

the second inequality here follows from "L 1 x L 1 normalization" condition 
(|2.ip . the second one from (|2.4p and the last one is just the trivial inequality 
2xy < x 2 + y 2 . 

Applying Theorem 12.51 to each sum we get that 

Ks/to 1 / 3 ,^)! < ^ + c(* 2 )|| 5 ||I) . 

Replacing / i-» i/ ', g (->• i > we get 

KS/ti; 1 / 2 ,^)! < ^ (i 2 C(* 1 )||/|| 3 + i- 2 C(* 2 )|M||) . 
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Taking infimum over all t > and recalling that 2ab = inft>o(t 2 a-\-t~ 2 b for 
a, b > we obtain 

KSfw^^gv 1 ^ < (C(* 1 )C(* 2 )) 1 /2||/|| 2 || ff || 2) 

which is exactly statement (i) of Theorem 12.31 □ 
For the statement (iii) of Theorem 12.31 we also need another embedding 
theorem. 

Theorem 2.6. Let ^ be as above. Then for any normalized Carleson se- 
quence {aj} l€T) (a j > 0), i.e. for any sequence satisfying 



sup)/^ 1 ^2 a,\l'\<l 

we get 



IeV I'ev-.l'cl 



_ (fw 1 / 2 ) 2 



lev 

where again C = C($f). 



Let us show that this theorem together with Theorem 12.51 implies state- 
ment (iii) of Theorem 12.31 Let a l = HAjbH^,. 

Again, multiplying if necessary the weight v w by appropriate constants 
we can assume (12.41). Then we can write 



1(^/^/2,^1/2)1 < £ K/^/2)^ . | (A/ft) A/ (V /2 ))I 

lev 

, v K^ 1/2 ) 7 IK) 1/2 m 1/2 \\^(9v l/2 \\i 

-2^/ f x 1/2 / s 1/2 

lev (n^JVy)) (n» a (2V?)) I^l 1 / 2 



the second inequality holds because of ([2.4p . and the last one is just the 
C auchy-S chwar t z . 

Estimating the sums in parentheses by Theorem 12.61 and 12.51 respectively 
we get statement (iii) of Theorem 12.31 □ 

3. Proof of (the Differential Embedding) Theorem 12.51 

3.1. Bellman function and main differential inequality. Let (p(s) := 

s\I/(s). Multiplying by an appropriate constant we can assume without 
loss of generality that 

r 1 l 

(3.1) / — -ds = l. 

Jo ¥>(*) 
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Define m(s) on [0, 1] by m(0) = m'(0) = 0, m"(s) = l/<p(s). Identity (|3Tj) 
implies that m is well-defined and m'(s) < 1, m(s) < s. For a distribution 



function N = Nf define 



(3.2) u(N)= (2N(t)-m(N(t)))dt = 2(w) I - m(N(t))dt; 

Jo Jo 

Note that the inequality m(s) < s implies that u(Nf) > (w)j. 

The functional u is defined on the convex set of distribution functions, i.e. 
on the set of decreasing functions N : [0, oo) — > [0, 1] such that f Q N(t)dt < 
oo. 

In what follows we can consider only finitely supported functions N, and 
then use standard approximation reasoning. Consider two distribution func- 
tions iV and Ni and let AiV = N\ — N . Denote also 

poo rod 

w := / N(t)dt, wi := / N^tfdt, 
Jo Jo 

and let 

roo 

Aw := wi - w = / AN(t)dt; 
Jo 

the motivation for this notation is that if iV and N± are the distribution 
functions of the weights w and wi, then the integrals are the averages on 
the corresponding weights. Denote also 

roo 

(3.3) w A := / \AN(t)\dt; 

Jo 

clearly |Aw| < w A . 

Let us compute derivatives of u in the direction of AiV. The first derivative 
is given by 

d f°° 
u^(iV) = ^u(N + TAN) ^ = = ^ (2 - m'(N(t))) AN(t)dt, 

so, in particular 



Therefore we can write 



(3.4) u' AN = kw a , k = k(aN), \k\ < 2. 

The second derivative in the direction AiV = N± — N is given by 

-ul N (N) = -^u(N + TAN) t= = ^ v ,(iV(t))- 1 (AiV(t)) 2 ^ 

By Cauchy-Schwarz, the integral in the right side is at least 



-1 r 



2 



N(t)V(N(t))dt I \AN(t)\dt =n(N)- 1 [j \AN(t)\dt 

n(iV)- 1 (w A ) 2 , 



2 
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SO 



(3.5) 



-u". r (JV)> 



(w A ) 2 
n(N) 



For the scalar variable f € K and the distribution function N define the 
Bellman function B(f,N) = B(f,u(iV)) where 

f 2 

B(f,u) 



u 



Computing second derivative of £> in the direction A 

r 



u 



Af 
/ 



AN 



£>fu £> uu 



U 



Af 

/ 



The Hessian is easy to compute 
(3.6) 



B ff B fu 

£>fu £> uu 



2f 



AN 



_ 2f 
2f^ 



note that this matrix is positive semidefinite. 
Since £> u = — f 2 /u 2 , we get using (|3.5p 



= (Af , AiV) we get 



u 2 n 



Thus, gathering everything and using (|3.4p we get 

r / 2 2f 

W\ + 2^ 



(3.7) 



Af 



u 

2f 2f 2 



Af 



The matrix here is obtained from the Hessian in (|3.6p by multiplying the 
lower right entry by 1 + — 1, so it has more positivity than the Hessian. 
In particular, if we divide the upper left entry of the matrix in (|3.7|) by the 
same quantity 1 + 2 ^ n , the matrix still be positive semidefinite. But our 
matrix in ()3.7p has something bigger in the upper-left corner! 
Therefore, since 



1 



1 + 



u 



2K 2 n 



-i 



u 



2«rn + u 



we get that 
(3.8) 



A ~ 2K 2 n + u 



> 



2(Af) 2 



2-2 2 n + u 



> c 



(Af) S 



n 



the last inequality holds because u < 2w < Cn. 

But (|3.8p is the exactly the differential form of the inequality we need! 



3.2. Main inequality in the finite difference form. 
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3.2.1. Dyadic case. 
Lemma 3.1. Let 



Then 



(3.9) i(0(f 1 ,u(iV 1 )) + S(f 2 ,u(iV 2 ))) -B(f,u(iV)) > £ if ~ tr 



where c is the constant from (|3.8p , (Note that fx — f = f — fa, so we can 
replace (fx — f) 2 m i/ie right side by (f% — f) 2 j 

Proof. Notice that 

(3.10) £l±fi* (£1±«) >£1±«* ( S1 + S2 )>i si *( si ); 

here the first inequality holds because ^ is decreasing and the second one 
because s^f(s) is increasing. Of course, we can interchange si and s 2 in the 
above inequality. 

Let Af := fi - f , AN := N x - N. Define 

F(r) = B(f + rAf , u(JV + taN)) + B(f - rAf , u(N - taN)) 

Taylor's formula together with the estimate (|3.8p imply that 

(3.11) F(l) - F(0) > ^(Af) 2 ( — — — + ' 



2 V ' \ti(N + taN) u(N-taN) 



for some r 6 (0, 1). 

Estimate (|3.10p implies that 



so 



n(iV) > ^n(N±TAN), 



1 1 \ 1 

+ ^TT ^77 > 



n(iV + TAiV) n(N - taN) J ~ n(N)' 
Then it follows from (13-llh that 

C (Af) 2 



F(l) - F(0) > 



2 n(N) ' 



Recalling the definition of F and dividing this inequality by 2 we get (|3.10p . 

□ 
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3.2.2. General case. Let ip and B be as above. 

Lemma 3.2. Let f, G R, a k G R+ and f/ie distribution functions N, N k , 
k = 1,2, ... ,n satisfy 

n n n 

f = ^2a k f k , N = ^a k N k , ^a fc = l. 
fc=i fc=i fe=i 

T/ien 

n i / n \ ^ 

(3.12) -B(f , iV) + ^ « fc B(f , iV fc ) > ^ • ^ f ]T a fc |f fc - f | J 

Proof. The reasoning below is a "baby version" of the reasoning used to 
prove the main estimate (Lemma 6.1) in [22J. 

For a weight a = {a k } k=1 , a k > 0, let £ p (a) be the weighted (finite- 
dimensional) £ p spaces, IMIfj^Q,) = Y^ 7 k=i a k\ x k\ p (i°°(a) is just the usual 
finite-dimensional £°°). 

Let ( • , • ) Q be the standard duality (x, y) a = Y*k=i a kX k y k - 

Define e G £P(a), e = (1,1,... ,1). 

Consider the quotient space X = ^(a)/span{e}. For x G i x {a) let 
x° := x - \\e\\j 1 1 (a) (x,e) a e, so J2k=i a k x l = °- Tnen 

(3.13) IMI^ — \\ x °\\e 1 (a) — ^H^ll^.- 

Indeed, the first inequality is trivial (follows from the definition of the norm 
in the quotient space). As for the second one, |(x,e) Q | < ||a;||^i( a ), so it 
follows from the triangle inequality that 

ll^ll^Ca) - H x ll^(a) + INI^a)!^'* 3 )! ' ll e ll^(") - 2 IMI^(a)- 

This inequality remains true if one replaces x by x — ae, a G R, so the 
second inequality in f)3. 13|) is proved. 

The dual space X* can be identified with s subspace of £°° = l°°{a) 
consisting of x* G £°°(a) such that (e, x*) a = (with the usual £°°-norm). 

So, for the vector x = (xx,X2, ■ ■ ■ x n ), x k = f k — f (notice that (x, e) a = 
there is j3 = {(3 k } k=1 , \/3 k \ < 1 such that J2 k =i a kfik = and 



n 1 1 n 

^2a k /3 k (f k - f) = \\x\\ x > -||x|| £ i (q) = - ^a k % - f | 

k=l k=l 

Define f+, f", N+, N~ by 

n n 
k=l k=l 

By Lemma |3. II 

(3.14) \(b{? + ,N + )) + B(f-,N-))) -B(f,N) > ° (f ' " 



2 V v ' " y ' ") v ' ' ~ 4 n(N) 
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We know that 

n n \ n 

f + - f = ^^oikPkik = ^afc/?fc(ffc - f) > r E afc l ffc ~~ f l 

k=l k=l k=l 

(the second equality holds because Y^"k=i a kPk = 0), so the right side of 
(|3.14|) is estimated below by 

16 n(N) k J 

Since the function B is convex 

n 

B(f+,N+) < ^2a k (l + p k )B(f k ,N k ), 

k=l 

n 

B(f-,N~) <J2 a k (l-P k )B(f k ,N k ) 

k=l 

and adding these inequalities we can estimate above the left side of (|3,14p 
by 

n 

-B(f,N) + ^2a k B(f,N k ). 

k=l 

□ 

3.3. From main inequality (13. 12p to Theorem 12.51 Fix an interval 1° 
and let I k be its children. Applying Lemma l3~2l with = (fw 1 ^ 2 ) r , N k = 

Nf and a& = we get denoting / := fw 1 / 2 

re ■ n(N%y\p\ * h j0 wV^)+ o iji • m v *r) 

Applying this formula to all children of 1°, then to their children and adding 
up the inequalities we get after going n generations down that 

Te E J^f'li <-i / °|g((/) J o^r°)+ E \i\-B{(f) r Nf) 

lech k (i°) v 1 ' 1 1 /ech„(/0) 

0<fc<n 

< Y, \I\-B({f) v Nf). 

/Gch„(/0) 

We know that B(f, N) < C^y, and since (see (JH2D) u(Nf) > (w) I we con- 
clude using the Cauchy-Schwartz estimate \(fw 1 ^ 2 ) I \ 2 < (\f\ 2 ) I (w) I that 



(fw 1 / 2 ) 2 r 

I\ ■ B({f) r Nf) < C\I\ S /J = C\I\(\f\ 2 ) I = C / 

W, J i 
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Therefore, estimating the right side we get 
iech h {i°) y 11 1 1 

0<fc<n 

Since the right side does not depend on n we can make n — > oo, and have 
the sum in the left side over all I £ £, / C 1°. 

Taking the sum over all 1° € C— m and letting m-foowe get conclusion 
of the theorem. □ 

4. Proof of (the Embedding) Theorem 12. 61 

4.1. An auxiliary function. Let ^ be the function from Theorem 12.61 
Define (p(s) := a\P(a). 

For the numbers A £ [1, 2], N € R + define 

/■JVM i 
T(A,N):=N —ds 
Jo <PW 

Lemma 4.1. The function T is convex and satisfies the differential inequal- 
ity 

dT 1 iV 2 



dA ~ 4 <p(JV) 
Proof. Differentiating the integral we get 
, . dT N 2 IN 



&4 A 2 (p(N/A) ~ 4 <^(iV) ' 

since </? is increasing and A < 2. 

To prove the convexity notice that T is linear on the lines N = kA, so 
the Hessian d 2 T degenerates. 

Differentiating (|4.ip we get 

d 2 T = 2 2A<p(N/A)-N<S(N/A) 
8A 2 (A 2 <p(N/A)) 2 

Note that the right side is positive if sip'(s) < 2ip{s) (because <p(s) > 0). 

But for our function even a stronger inequality s<p'(s) < </?(a) is satisfied! 
Indeed, since (p(s) = s^f(s) is increasing and is decreasing, then 

< (**(s))' = *(s) + a*' (a) < *(a) 

(the second inequality holds because ^ is decreasing). Multiplying this 
inequality by s we get s<p'(s) < <p(s). 

Therefore, since cp(s) > 0, we conclude that £p > 0. 

But the Hessian d 2 T is singular, and it is an easy exercise in linear algebra 
to show that a singular Hermitian 2x2 matrix with a positive entry on the 
main diagonal is positive semidefinite. □ 
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4.2. Bellman function and the main differential inequality. Let now 

N be a distribution function, and let 



T{A,N) = / T(A,N(t))dt. 
Jo 



As in Section \3. II assume, multiplying ^ by an appropriate constant, that 

-ds = 1. 



1 J_ 

o ¥>(*)' 
Then r(.A,JV(t)) < N(t), so 

/■oo 

T(A,N)< N(t)dt=:w = vr(N). 
Jo 

For f 6 R, M £ [0, 1] and for a distribution function N define the function 
B(f , AT, M) := B(f , u(M, iV)), where 

f 2 

B(f,u) = - 

and 

/■oo 

u = u(M, N) = 2 N(t)dt - T(M + 1, JV) 
= : 2w(iV) - T(M+ 1, AT). 

Note that 2w(N) > u(M,N) > w(N). 

We claim that the function B is convex. Indeed, fix a direction A := 
(Af , AN, aM) t and compute the second derivative B"^ in this direction 



d 2 ~ 

B" = —^B(f + rAf , N + taN, M + taM) 
dr z 



T = 





Af ^ 


)( 






< y 


2_ 


_ 2f 


u 




2f 


2^ 




u 3 



We get 



The Hessian 

Bs B fu 
Bf u B uu 

is clearly positive semidefinite, so the first term is nonnegative. For the 
second term notice that 

(4.2) B u = ~, < = -T* < 

because T, and therefore T is convex. Thus B'^ > 0, so B is convex. Let us 
compute the partial derivative 

dB _ n ( dT \ 

( 3) ~dM ~ ~ Bu dM -rf'\dM 
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By Lemma 14.11 

dT 1 f°° N(t) 2 





1 


> 






4 ' 






> 


U 







dt 



dM ~ 4 J f(N(t)) 



JV(t)dt / <p(N(t))dt 



4 n(iV) ' 



the second inequality here is just the Cauchy-Schwartz. Combining with 
(|4.3p and recalling that u < 2w we get 



dB 1 f 2 

(4.4 _ > 

v ' dM ~ 16 n 

This inequality (together with the convexity of B) is the main differential 
inequality for our function. 



4.3. Finite difference form of the main inequality. Let X = (f , N, M), 

X k = (f k ,N k ,M k ), (f,f fc G R, M,M k G [0,1], N, N k are the distribution 
functions) satisfy 

n n n 

fe=i fe=i fc=i 

where 

n 

^Jajfc = 1, a k > 0. 



fc=l 



Then 



n if 2 

(4.5) -£(X) + V a k B{X k ) >-■ — 

^-^ lb n 

k=l 

where n = n(N). 

Indeed, for Mq := J2k=i a kM k the main inequality (14. 4p implies 

B(f,N,M )-B(i,N,M)>^-.—. 

lb n 

The convexity of B implies that 

n 

B(f,N,M )<Y,a k B(Xk) 
k=l 

which together with the previous inequality gives us (14. 5L 
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4.4. Prom main inequality (14. 5p to Theorem 12.61 The reasoning here 
is almost verbatim the same as in Section [3.31 

For an interval let us I € C denote = {fw 1/2 ) I , Nj = Nf, Mj = 

i j r 1 £j'c/ a r> w / = ( w h> u i = u ( M /' Ar / )- 

Fix 1° € C, and let I k be its children. Applying the inequality (|4.5p with 
a k = \h\/\I% f fc = f Ik , N k = Nf k , M k = M Ik we get that 

v I ' iech(io) 

Writing the corresponding estimates for the children of 1°, then for their 
children, we get after going n generations down and using the telescoping 
sum in the right side 

a E ^v\s-\mx„H E m*w 

/ech fc (j°) v 1 ' /ech„(/Q) 

0<fe<n 

/ech„(/0) 

the last inequality holds because B > 0. 
Since 

< fJ/ Uj < f z 2 / Wj 
(the last inequality holds because u > w) and by Cauchy-Schwartz 

K/^V < <l/l 2 >>>/> 

we conclude, exactly as in Section [3.31 that 

|i|B(Jr J )<|iK|/| a > I = jT|/| a (^ > 



so 



1 a fi 



16 ^ n(N? 

/ech fc (/°) 1 

0<k<n 



\I\ < [ l/l V 
Jl° 



Conclusion of the proof is exactly as in Section 13.31 we first let n — > oo, 
and then taking the sum over 1° € C- m and letting m — > oo get the desired 
estimate. □ 

5. Concluding remarks and open problems 

5.1. One sided bumps. The famous theorem of P. Koosis |llj states that 
given a weight u on the unit circle, one can find a non-zero weight v such 
that the Hilbert transform T is bounded as an operator from L 2 {u) to L 2 {v) 
if and only if 1/u € L l . The same result hods for the maximal function, see 

eh. 
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So it is possible to have a situation when one has two weight estimates, 
but one cannot "bump" the Z^-norm of 1/u (w in our notation). This leads 
to a very natural in our opinion) question "can one "bump" only one weight 
to get the two weight estimate?" For example, can one find a reasonable 
Young function $ that the condition 



implies the boundedness inL 2 of the operator M v i/ 2 TM w i/2 for all (bounded) 
Calderon-Zygmund operators T? 

Note, that for maximal function a one sided bump condition (but with 
the bump on the "wrong" side) is sufficient. Namely, it follows from the 
result in |19] that if f°° (&(x))~ 1 dx < oo and the weights v, w satisfy 



then the operator M v i/ 2 AdM w i/2, where M. is the Hardy-Littlewood max- 
imal operator, is bounded in 1? . It is natural to remark here that in |16j 
we demonstrated this result of Perez by almost precisely the same Bellman 
function that the reader saw above. 

5.2. Estimates for general measures. A standard way to set up the 

two weigh estimate problem for integral operators is to make the change of 
variables so in the integral operator one integrates with respect to the same 
measure that is used to compute the norm in the domain. 

namely, if one defines measures fi, v, djj, = wdx, dv = vdx, then the LP 
boundedness of the operator M v i/ p TM wl / p / is equivalent (at least formally) 
to the boundedness of the operator : L p {[i) — > LP{v), where 



K here is the kernel of the Calderon-Zygmund operator T. 

In fact, everything can be interpreted absolutely rigorously. The bound- 
edness of such operators can be interpreted as uniform boundedness of the 
smooth truncations; in fact such uniform boundedness is equivalent to the 
boundedness of the bilinear form on functions with separated compact sup- 
ports, i.e. to the weakest possible notion of boundedness, see [13] . 

This setting give the most general form of the two weight problem, since 
\x and v can be general measures, not necessarily absolutely continuous, 
they even can be purely singular. So the question arises, "how one can 
bump general measures?" The approach with Orlicz spaces, or any other 
functions spaces works only for absolutely continuous measures. 




(5.1) 
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